Overview

Dataset Statistics

Number of Variables 10
Number of Rows 18581
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.4 MB
Average Row Size in Memory 248.5 B
Variable Types
  • Numerical: 5
  • Categorical: 4
  • DateTime: 1

Dataset Insights

id is uniformly distributed Uniform
extra_features_count is skewed Skewed

Variables


id

numerical

Approximate Distinct Count 18581
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 297296
Mean 10026.6576
Minimum 0
Maximum 20097
Zeros 1
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • id is uniformly distributed
  • id is skewed right (γ1 = 0.0053)

Quantile Statistics

Minimum 0
5-th Percentile 1011
Q1 5020
Median 10021
Q3 15014
95-th Percentile 19089
Maximum 20097
Range 20097
IQR 9994

Descriptive Statistics

Mean 10026.6576
Standard Deviation 5797.5769
Variance 3.3612e+07
Sum 1.8631e+08
Skewness 0.005265
Kurtosis -1.1948
Coefficient of Variation 0.5782

make

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1324903
  • The largest value (Nissan) is over 38.45 times larger than the second largest value (Nissan Motor Egypt)

Length

Mean 6.3042
Standard Deviation 1.8862
Median 6
Minimum 6
Maximum 18

Sample

1st row Nissan
2nd row Nissan
3rd row Nissan
4th row Nissan
5th row Nissan

Letter

Count 116196
Lowercase Letter 96673
Space Separator 942
Uppercase Letter 19523
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Nissan, Nissan Motor Egypt) take over 50.0%
  • The largest value (nissan) is over 39.45 times larger than the second largest value (egypt)

model

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1307319
  • The largest value (Sunny) is over 4.56 times larger than the second largest value (Qashqai)

Length

Mean 5.3578
Standard Deviation 0.7907
Median 5
Minimum 4
Maximum 14

Sample

1st row Juke
2nd row Juke
3rd row Juke
4th row Juke
5th row Juke

Letter

Count 99552
Lowercase Letter 80969
Space Separator 2
Uppercase Letter 18583
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Sunny, Qashqai) take over 50.0%
  • The largest value (sunny) is over 4.56 times larger than the second largest value (qashqai)

model_year

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 297296
Mean 2016.3714
Minimum 1999
Maximum 2023
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • model_year is skewed left (γ1 = -1.0447)

Quantile Statistics

Minimum 1999
5-th Percentile 2008
Q1 2014
Median 2017
Q3 2020
95-th Percentile 2022
Maximum 2023
Range 24
IQR 6

Descriptive Statistics

Mean 2016.3714
Standard Deviation 4.3296
Variance 18.7459
Sum 3.7466e+07
Skewness -1.0447
Kurtosis 1.6433
Coefficient of Variation 0.002147
  • model_year is not normally distributed (p-value 0.0008319414237261155)
  • model_year has 311 outliers

kilometers

numerical

Approximate Distinct Count 581
Approximate Unique (%) 3.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 297296
Mean 94271.8776
Minimum 0
Maximum 285000
Zeros 237
Zeros (%) 1.3%
Negatives 0
Negatives (%) 0.0%
  • kilometers is skewed right (γ1 = 0.2982)

Quantile Statistics

Minimum 0
5-th Percentile 9999
Q1 41000
Median 90000
Q3 139999
95-th Percentile 200000
Maximum 285000
Range 285000
IQR 98999

Descriptive Statistics

Mean 94271.8776
Standard Deviation 59994.5767
Variance 3.5993e+09
Sum 1.7517e+09
Skewness 0.2982
Kurtosis -0.7432
Coefficient of Variation 0.6364

transmission_type

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1371817
  • The largest value (Automatic) is over 16.55 times larger than the second largest value (Manual)

Length

Mean 8.829
Standard Deviation 0.6955
Median 9
Minimum 6
Maximum 9

Sample

1st row Automatic
2nd row Automatic
3rd row Automatic
4th row Automatic
5th row Automatic

Letter

Count 164052
Lowercase Letter 145471
Space Separator 0
Uppercase Letter 18581
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Automatic, Manual) take over 50.0%
  • The largest value (automatic) is over 16.55 times larger than the second largest value (manual)

price

numerical

Approximate Distinct Count 798
Approximate Unique (%) 4.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 297296
Mean 274666.3796
Minimum 0
Maximum 1384000
Zeros 13
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • price is skewed right (γ1 = 1.5939)

Quantile Statistics

Minimum 0
5-th Percentile 129000
Q1 181000
Median 248000
Q3 338000
95-th Percentile 513000
Maximum 1384000
Range 1384000
IQR 157000

Descriptive Statistics

Mean 274666.3796
Standard Deviation 129164.7997
Variance 1.6684e+10
Sum 5.1036e+09
Skewness 1.5939
Kurtosis 4.4912
Coefficient of Variation 0.4703
  • price is not normally distributed (p-value 3.864411120753664e-05)
  • price has 610 outliers

priced_at

datetime

Distinct Count 226.3906
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Memory Size 297296
Minimum 2022-02-02 00:00:00
Maximum 2023-04-30 00:00:00

mileage_category

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 167722

Length

Mean 7.3559
Standard Deviation 1.7278
Median 8
Minimum 5
Maximum 9

Sample

1st row 200k+
2nd row 200k+
3rd row 0-50k
4th row 100k-150k
5th row 0-50k

Letter

Count 30900
Lowercase Letter 30900
Space Separator 0
Uppercase Letter 0
Dash Punctuation 17601
Decimal Number 87199
  • The top 2 categories (50k-100k, 0-50k) take over 50.0%

extra_features_count

numerical

Approximate Distinct Count 39
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 297296
Mean 12.4529
Minimum 1
Maximum 39
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • extra_features_count is skewed right (γ1 = 0.7881)

Quantile Statistics

Minimum 1
5-th Percentile 4
Q1 6
Median 9
Q3 18
95-th Percentile 26
Maximum 39
Range 38
IQR 12

Descriptive Statistics

Mean 12.4529
Standard Deviation 7.7952
Variance 60.7657
Sum 231387
Skewness 0.7881
Kurtosis -0.5413
Coefficient of Variation 0.626
  • extra_features_count is not normally distributed (p-value 5.9487054509384966e-21)
  • extra_features_count has 34 outliers

Interactions

Correlations

Missing Values